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<N Abstract 

Konig's theorem states that on bipartite graphs the size of a maximum matching equals the 
size of a minimum vertex cover. It is known from prior work that for every e > there exists a 
constant-time distributed algorithm that finds a (1 + e)-approximation of a maximum matching 
on 2-coloured graphs of bounded degree. In this work, we show — somewhat surprisingly — that 
no sublogarithmic-time approximation scheme exists for the dual problem: there is a constant 
S > so that no randomised distributed algorithm with running time o(logn) can find a (1 + 5)- 
approximation of a minimum vertex cover on 2-coloured graphs of maximum degree 3. In fact, a 
simple application of the Linial-Saks (1993) decomposition demonstrates that this lower bound 
is tight. 

Our lower-bound construction is simple and, to some extent, independent of previous tech- 
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niques. Along the way we prove that a certain cut minimisation problem, which might be of 
independent interest, is hard to approximate locally on expander graphs. 



> 

1 Introduction 

Many graph optimisation problems do not admit an exact solution by a fast distributed algorithm. 
This is true not only for most NP-hard optimisation problems, but also for problems that can be 
solved using sequential polynomial-time algorithms. This work is a contribution to the distributed 
approximability of such a problem: the minimum vertex cover problem on bipartite graphs — we call 
it 2-VC, for short. 

Our focus is on negative results: We prove an optimal (up to constants) time lower bound 
O(logn) for a randomised distributed algorithm to find a close-to-optimal vertex cover on bipartite 
2-coloured graphs of maximum degree A = 3. In particular, this rules out the existence of a 
sublogarithmic-time approximation scheme for 2-VC on sparse graphs. 

Our lower bound result exhibits the following features: 

- The proof is relatively simple as compared to the strength of the result; this is achieved 
through an application of expander graphs in the lower-bound construction. 

- To explain the source of hardness for 2-VC we introduce a certain distributed cut minimisation 
problem, which might have applications elsewhere. 

- Many previous distributed inapproximability results are based on the hardness of local symmetry 
breaking. This is not the case here: the difficulty we pinpoint for 2-VC is in the semi-global 
task of gluing together two different types of local solutions. 

- Our result states that Konig's theorem is non-local — see Section 1.3. 
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1.1 The LOCAL model 



We work in the standard LOCAL model of distributed computing [10, 17]. As input we are given 
an undirected graph G = (V, E). We interpret G as defining a communication network: the nodes 
V host processors, and two processors can communicate directly if they are connected by an edge. 
All nodes run the same distributed algorithm A. The computation of A on G starts out with every 
node v G V knowing an upper bound on n = \V\ and possessing a globally unique 0(logn)-bit 
identifier ID(u); for simplicity, we assume that V C {1, 2, . . . , poly(n)} and ID(v) = v. Also, we 
assume that the processors have access to independent (and unlimited) sources of randomness. The 
computation proceeds in synchronous communication rounds. In each round, all nodes first perform 
some local computations and then exchange (unbounded) messages with their neighbours. After 
some r communication rounds the nodes stop and produce local outputs. Here r is the running 
time of A and the output of v is denoted A(G, v). 

The fundamental limitation of a distributed algorithm with running time r is that the output 
A(G, v) can only depend on the information available in the subgraph G[v, r] C G induced on the 
vertices in the radius-r neighbourhood ball 

Ba(v,r) = {u G V : distG(v,u) < r}. 

Conversely, it is well known that an algorithm A can essentially discover the structure of G[v, r] in 
time r. Thus, A can be thought of as a function mapping r-neighbourhoods G[v, r] (together with 
the additional input labels and random bits on Bq(v, r)) to outputs. 

While the LOCAL model abstracts away issues of network congestion and asynchrony, this only 
makes our lower-bound result stronger. 

1.2 Our result 

Below, we concentrate on bipartite 2-coloured graphs G. That is, G is not only bipartite (which is a 
global property), but every node v is informed of the bipartition by an additional input label c(v), 
where c: V — > {white, black} is a proper 2-colouring of G. 

Definition 1. In the 2-VC problem we are given a 2-coloured graph G = (G, c) and the objective 
is to output a minimum-size vertex cover of G. 

A distributed algorithm A computes a vertex cover by outputting a single bit A(G, v) G {0, 1} 
on a node v indicating whether v is included in the solution. This way, A computes the set 
A(G) := {v G V : A(G, v) = 1}. Moreover, we say that A computes an a- approximation of 2-VC if 
A(G) is a vertex cover of G and 

\A(G)\ < a-OPT G , 

where OPTg denotes the size of a minimum vertex cover of G. 
Our main result is the following. 

Theorem 1. There exists a 5 > such that no randomised distributed algorithm with running time 
o(logn) can find a (1 + 5) -approximation of 2-VC on graphs of maximum degree A = 3. 

A matching time upper bound follows directly from the well-known network decomposition 
algorithm due to Linial and Saks [11]. 

Theorem 2. For every e > a (1 + e) -approximation of '2-VC can be computed with high probability 
in time 0(e _1 logn) on graphs of maximum degree A = 0(1). 
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Proof. The subroutine Construct Slock in the algorithm of Linial and Saks [11] computes, in time 
r = 0(e _1 logn), a set S C V with the following properties. Each component in the subgraph G[S] 
induced by S has weak diameter at most r, i.e., distc(^, v) < r for each pair u,v £ S belonging to 
the same component of G[S]. Moreover, they prove that, w.h.p., 

|S|>(l-e)n. 

Let C be a component of G[S]. Every node of C can discover the structure of C in time 0(r) 
by exploiting its weak diameter. Thus, every node of C can internally compute the same optimal 
solution of 2-VC on C. We can then output as a vertex cover for G the union of the optimal 
solutions at the components together with the vertices V \ S. This results in a solution of size at 
most 

O pT G[S] + en. 

But since OPTc > \E\/A = Q(n) for connected G, this is a (1 + 0(e))-approximation of 2-VC. □ 



1.3 Konig duality 



The classic theorem of Konig (see, e.g., Diestel [3, §2.1]) states that, on bipartite graphs, the size of a 
maximum matching equals the size of a minimum vertex cover. A modern perspective is to view this 
result through the lens of linear programming (LP) duality. The LP relaxations of these problems 
are the fractional matching problem (primal) and the fractional vertex cover problem (dual): 



maximise x e 
subject to ^2 x e <1, V-u G V 

e: -uGe 

x > 



minimise 



subject to ^2 Vv > 1) Ve G E 



v. v£e 

y > o 



In particular, on bipartite graphs, the above LPs do not have an integrality gap (see, e.g., Papadim- 
itriou and Steiglitz [15, §13.2]): among the optimal feasible solutions are integral vectors x G {0, 1} E 
and y G {0, 1} V that correspond to maximum matchings and minimum vertex covers. 

In the context of distributed algorithms, the following is known on (bipartite) bounded degree 
graphs : 

(1) Primal LP and dual LP admit local approximation schemes. As part of their general result, 
Kuhn et al. [7] give a strictly local (1 + e)-approximation scheme for the above LPs. Their 
algorithms run in constant time independent of the number of nodes. 

(2) Integral primal admits a local approximation scheme. Astrand et al. [1] describe a strictly 
local (1 + e)-approximation scheme for the maximum matching problem on 2-coloured graphs. 
Again, the running time is a constant independent of the number of nodes. 

(3) Integral dual does not admit a local approximation scheme. The present work shows — in 
contrast to the above positive results — that there is no local approximation scheme for 2-VC 
even when A = 3. 



1.4 Related lower bounds 

There are relatively few independent methods for obtaining negative results for distributed approxi- 
mation in the LOCAL model. We list three main sources. 
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Local algorithms. Linial's [10] lower bound Q(log*w) for 3-colouring a cycle together with 
the Ramsey technique of Naor and Stockmeyer [13] establish basic limitations on finding exact 
solutions strictly locally in constant time. These impossibility results were later extended to finding 
approximate solutions on cycle-like graphs by Lenzen and Wattenhofer [9] and Czygrinow et al. [2]. 
A recent work [4] generalises these techniques even further and proves that deterministic local 
algorithms in the LOCAL model are often no more powerful than algorithms running on anonymous 
port numbered networks. For more information on this line of research, see the survey of local 
algorithms [18]. 

Here, the inapproximability results typically exploit the inability of a local algorithm to break 
local symmetries. By contrast, in this work, we consider the case where the local symmetry is 
already broken by a 2-colouring. 

KMW-bounds. Kuhn, Moscibroda and Wattenhofer [6-8] prove that any randomised algorithm 
for computing a constant-factor approximation of minimum vertex cover on general graphs requires 
time Q(\/\ogn) and f2(log A). Roughly speaking, their technique consists of showing that a fast 
algorithm cannot tell apart two adjacent nodes v and u, even though it is globally more profitable 
to include v in the vertex cover and exclude u than conversely. 

The lower-bound graphs of Kuhn et al. are necessarily of unbounded degree: on bounded degree 
graphs the set of all non-isolated nodes is a constant factor approximation of a minimum vertex 
cover. By contrast, our lower-bound graphs are of bounded degree and they forbid close-to-optimal 
approximation of 2-VC. 

Sublinear-time centralised algorithms. Parnas and Ron [16] discuss how a fast distributed 
algorithm can be used as solution oracle to a centralised algorithm that approximates parameters 
of a sparse graph G in sublinear time given a randomised query access to G. Thus, lower bounds 
in this model of computation also imply lower bounds for distributed algorithms. In particular, 
an argument of Trevisan (presented in [16]) implies that computing a (2 — e)-approximation of a 
minimum vertex cover requires fi(logn) time on d- regular graphs, where d = d(e) is sufficiently 
large. 

We note that 2-VC is easy to approximate in this model: Nguyen and Onak [14] give a centralised 
constant-time algorithm to approximate the size of a maximum matching on a graph G. If we are 
promised that G is bipartite, the same algorithm approximates the size of 2-VC by Konig duality. 

2 Deterministic lower bound 

To best explain the basic idea of our lower bound result, we first prove Theorem 1 for a toy model 
that we define in Section 2.1; in this model, we only consider a certain class of deterministic 
distributed algorithms in anonymous networks. Later in Section 3 we will show how to implement 
the same proof technique in a much more general setting: randomised distributed algorithms in 
networks with unique identifiers. 

In the present section, we find a source of hardness for 2-VC as follows. First, we argue that 
any approximation algorithm for the 2-VC problem also solves a certain cut minimisation problem 
called Recut. We then show that Recut is hard to approximate locally, which implies that 2-VC 
must also be hard to approximate locally. 
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2.1 Toy model of deterministic algorithms 

Throughout this section we consider deterministic algorithms A running in time r = o(logn) that 
operate on input-labelled anonymous networks (G,£), where G = (V, E) and £ is a labelling of V. 
More precisely, we impose the following additional restrictions in the COCAC model: 

- The nodes of G are not given random bits as input. 

- The output of A is invariant under reassigning node identifiers. That is, if G' = (V',E') is 
isomorphic to G via a mapping /: V' — > V, then for v £ V, 

A(G,e,v)=A(G',£ofJ-\ v )). 

Put otherwise, the only symmetry breaking information we supply A is the radius-r neighbourhood 
topology together with the input labelling — the nodes are anonymous and do not have unique 
identifiers. 

We will also consider graphs G that are directed. In this case, the directions of the edges are 
merely additional symmetry-breaking information; they do not restrict communication. 

2.2 The Recut problem 

In the following, we consider partitions of V into red and blue colour classes as determined by a 
labelling I: V — > {red, blue}. We write d£ for the fraction of edges crossing the red/blue cut, i.e., 

Qg= e(r 1 (red),r 1 (blue)) 
\E\ 

Definition 2. In the Recut problem we are given a labelled graph (G,£) as input and the objective 
is to compute an output labelling (a recut) £ out that minimises d£ on t subject to the following 
constraints: (a) If £(V) = {red}, then £ out (V) = {red}, (b) If £(V) = {blue}, then £ out (V) = {blue}. 

In words, if we have an all-red input, we have to produce an all-red output, and if we have an 
all-blue input, we have to produce an all-blue output. Otherwise the output can be arbitrary. See 
Figure 1 for an illustration. 

Needless to say, the global optimum for an algorithm A would be to produce a constant output 
labelling £j± (either all red or all blue) having d£j± = 0. However, a distributed algorithm A can only 



input output of 'A optimum 




Figure 1: The Recut problem. In this example, we have used a simple distributed algorithm A to find 
a recut £ ut with a small boundary d£ olIt - a node outputs red iff there is a red node within distance r = 3 
in the input. While the solution is not optimal, in a grid graph the boundary will be relatively small. 
However, our lower bound shows that any fast distributed algorithm — including algorithm A — fails to 
produce a small boundary in some graph. 
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access the values of the input labelling i in its local radius-r neighbourhood: when encountering 
a neighbourhood v £ U C V with £(J7) = {red}, the algorithm is forced to output red at v to 
guarantee satisfying the global constraint (a) , and when encountering a neighbourhood v G U C V 
with i{U) = {blue}, the algorithm is forced to output blue at v to satisfy (b). Thus, if a connected 
graph G has two disjoint r-neighbourhoods U,U' Q V with i(U) = {red} and i(U') = {blue} the 
algorithm A cannot avoid producing at least some red/blue edge boundary. Indeed, the best we can 
hope A to achieve is a recut tj± of size dtj^ < e for some small constant e > 0. 

Discussion. The Recut problem models the following abstract high-level challenge in designing 
distributed algorithms: Each node in a local neighbourhood U C. V can, in principle, internally 
compute a completely locally optimal solution for (the subgraph induced by) U, but difficulties arise 
when deciding which of these proposed solution are to be used in the final distributed output. In 
particular, when the type of the produced solution changes from one (e.g., red) to another (e.g., 
blue) across a graph G one might have to introduce suboptimalities to the solution at the (red/blue) 
boundary in order to glue together the different types of local solutions. 

In fact, the Recut problem captures the first non-trivial case of this phenomenon with only two 
solution types present. One can think of the input labelling t as recording the initial preferences of 
the nodes whereas the output labelling ij± records how an algorithm A decides to combine these 
preferences into the final unified output. In the end, our lower-bound strategy will be to argue 
that any A can be forced into producing too large an edge boundary dl^ resulting in too many 
suboptimalities in the produced output. 

Next, we show how the above discussion is made concrete in the case of the 2-VC problem. 

2.3 The reduction 

We call a graph G tree-like if all the r-neighbourhoods in G are trees, i.e., G has girth larger than 
2r + 1. Furthermore, if G is directed, we say it is balanced if in-degree(t> ) = out-degree(v) for all 
vertices v. We note that a deterministic algorithm A produces the same output on every node 
of a balanced regular tree-like digraph G, because such a graph is locally homogeneous: all the 
r-neighbourhoods of G are pairwise isomorphic. 

Using this terminology we give the following reduction. 

Theorem 3. Suppose A computes a (1 + e)- approximation of 2-VC on graphs of maximum degree 
A = 3. Then, there is an algorithm (with running time r) that finds a recut £^ of size dtj± = 0(e) 
on balanced 4-regular tree-like digraphs. 

The proof of Theorem 3 follows the usual route: We give a local reduction (i.e., one that can 
be computed by a local algorithm) that transforms an instance (G, I) of Recut into an instance 
IL(G,£) of 2-VC. Then we simulate A on the resulting instance and map the output of A back to a 
solution of the Recut instance (G,£). 

Let G = (V, E) be a balanced 4-regular tree-like digraph and let t: V — > {red, blue} be a labelling 
of G. The instance H{G, i) is obtained by replacing each vertex vgVby one of two local gadgets 
depending on the label £(v). We first describe and analyse simple gadgets yielding instances of 
2-VC with A = 4; the gadgets yielding instances with A = 3 are described later. 

Red gadget. The red gadget replaces a vertex v £ V by two new vertices w v (white) and b v (black) 
that share a new edge e v := {w v , b v }. The incoming edges of v are reconnected to w v , whereas the 
outgoing edges of v are reconnected to b v . 
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Note that the 2-VC instance H(G, red) (where we denote by red the constant labelling v i— > red) 
contains {e v : v G V} as a perfect matching. Since (G, red) is locally homogeneous, the solutions 
output by A on the endpoints of e v are isomorphic across all v. Assuming e < 1 it follows that the 
algorithm A must output either the set of all white nodes or the set of all black nodes on T1(G, red). 
Our reduction branches at this point: we choose the structure of the blue gadget to counteract this 
white/black decision made by A on the red gadgets. We describe the case that A outputs all white 
nodes on n(G, red); the case of black nodes is symmetric. 




red input blue input w' v 



Blue gadget. The blue gadget replacing v G V is identical to the red gadget with the exception 
that a third new vertex w' v (white) is added and connected to b v . 

Similarly as above, we can argue that A outputs exactly the set of all black nodes on the instance 
n(G, blue). This completes the description of II. 

Next, we simulate A on U(G,£). The output of A is then transformed back to a labelling 
£4: V — > {red, blue} by setting 

@a( v ) = blue -4=>- the output of A contains only the black node b v at the gadget at v. 

Note that £4 satisfies both feasibility constraints (a) and (b) of Recut. It remains to bound the 
size dH.A of this recut. 




Recut analysis. Call a red vertex v in (G, tj^) bad if v has a blue out-neighbour u. By the 
definition of u £j^(u) = blue", the vertex cover produced by algorithm A does not contain the white 
node w u . Thus to cover the edge (b v ,w u ), the vertex cover has to contain the black node b v . But by 
the definition of u £a( v ) = r ed", we must have w v or w' v in the solution as well. Hence, at least two 
nodes are used to cover the gadget at v, which is suboptimal as compared to the minimum vertex 
cover {b v : v G V}, which uses only one node per gadget. This implies that we must have at most 
e\V\ bad vertices as A produces a (1 + e)-approximation of 2-VC on IL(G,£). 
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On the other hand, exactly half of the edges crossing the cut £j± are oriented from red to blue 
since G is balanced. Each bad vertex gives rise to at most two of these edges, so we have that 
di_A ■ \E\/2 < 2e\V\ which gives dlj± < 2e, as required. This proves Theorem 3 for A = 4. 

Gadgets for A = 3. The maximum degree used in the gadgets can be reduced to 3 by the 
following modification. The red gadget replaces a vertex v E V by a path of length 3. 




red input blue input w' v 



Again, to achieve a 1.499- approximation of 2-VC on II(G, red) the algorithm A has to make a 
choice: either leave out the middle black vertex or the middle white vertex from the vertex cover. 
Supposing A leaves out the middle black, the blue gadget is defined to be identical to the red gadget 
with an additional white vertex connected to the middle black one. 

After simulating A on an instance H(G, £) we define £j\_( v ) = blue iff A outputs only black nodes 
at the gadget at v. The recut analysis will then give dtj± < 4e. 

2.4 Recut is hard on expanders 

Intuitively, the difficulty in computing a small red/blue cut in the Recut problem stems from 
the inability of an algorithm A to overcome the neighbourhood expansion of an input graph in 
r = o(logn) steps — an algorithm cannot hide the red/blue boundary as the radius-r neighbourhoods 
themselves might have large boundaries. 

To formalise this intuition, we use as a basis for our lower-bound construction an infinite family 
T of 4-regular 5-expander graphs, where each G = (V, E) E T satisfies the edge expansion condition 

e(S,V \ S) > 8- \S\ for all S C V, \S\ < n/2. (1) 

Here, e(S, V \ S) is the number of edges leaving S and 5 > is an absolute constant independent 
of n = |V|. On such graphs it is enough for us to force an algorithm A to output a nearly balanced 
recut £yx having both colour classes close to n/2 in size. This is because if the number of, say, the 
red nodes is 

I^Cred)) = n/2- o(n), 
then the expansion property (1) implies that 

8l A > 8/ A -o(l). 

That is, A computes a recut of size 0,(8). 

Indeed, the following simple fooling trick makes up the very core of our argument. 

Lemma 4. Suppose A produces a feasible solution for the Recut problem in time r = o(logn). 
Then for each 4-regular graph G there exists an input labelling for which A computes a nearly 
balanced recut. 

Proof. Fix an arbitrary ordering v\ , v<i , . . . , v n for the vertices of G and define a sequence of labellings 
£°, £ , . . . , £ n by setting £ l (vj) = blue iff j < i. That is, in £° all nodes are red, in £ n all nodes are 
blue, and £ % is obtained from £ l ~ 1 by changing the colour of vi from red to blue. 
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When we switch from the instance (G, £ l ~ l ) to (G,£ l ) the change of Vi's colour is only registered 
by nodes in the radius-r neighbourhood of Vi. This neighbourhood has size \Bq(vi, r)| < 5 r = o(n), 
and so the number of red nodes in the outputs t^ 1 and £\ of A can only differ by o(n). As, by 
assumption, we have that A computes the labelling £^ = red on (G,£°) and the labelling P\ = blue 
on (G,£ n ), it follows by continuity that some labelling in our sequence must force A to output 
n/2 — o(n) red nodes. □ 

We now have all the ingredients for the lower-bound proof: We can take <5 = 2 - y/3 if we choose 
T to be the family of 4-regular Ramanujan graphs due to Morgenstern [12]. These graphs are 
tree- like, as they have girth 0(logn). They can be made into balanced digraphs since a suitable 
orientation can always be derived from an Euler tour. Thus, T consists of balanced 4-regular 
tree-like digraphs. Lemma 4 together with the discussion above imply that every algorithm for 
Recut produces a recut of size Q(5) on some labelled graph in J 7 . Hence, the contrapositive of 
Theorem 3 proves Theorem 1 for our deterministic toy algorithms. 

3 Randomised lower bound 

Even though our model of deterministic algorithms in Section 2 is an unusually weak one, we 
can quickly recover the standard LOCAL model from it by equipping the nodes with independent 
sources of randomness. In particular, as is well known, each node can choose an identifier uniformly 
at random from, e.g., the set {1,2,..., n 3 }, and this results in the identifiers being globally unique 
with probability at least 1 — 1/n. 

When discussing randomised algorithms many of the simplifying assumptions made in Section 2 
no longer apply. For example, a randomised algorithm need not produce the same output on every 
node of a locally homogeneous graph. Consequently, the homogeneous feasibility constraints in the 
Recut problem do not strictly make sense for randomised algorithms. 

However, we can still emulate the same proof strategy as in Section 2: we force the randomised 
algorithm to output a nearly balanced recut with high probability. Below, we describe this strategy 
in case of the easy-to-analyse "A = 4" gadgets with the understanding that the same analysis can 
be repeated for the "A = 3" gadgets with little difficulty. 

3.1 Repeating Section 2 for randomised algorithms 

Fix a randomised algorithm A with running time r = o(logn) and let G = (V,E), n = | V|, be a 
large 4-regular expander. 

Again, we start out with the all-red instance. We denote by W and B the number of black and 
white nodes output by A on n(G, red). As each of the edges e v must be covered, we have that 

W + B > n. 

Hence, by linearity of expectation, at least one of E[W] > n/2 or E[B] > n/2 holds. We assume 
that E[W] > n/2; the other case is symmetric. 

In reaction to A preferring white nodes, the blue gadgets are now defined exactly as in Section 2 
with an additional white vertex. Furthermore, for any input £: V — > {red, blue} we interpret the 
output of A on U(G,£) as defining an output labelling £jy of V, where, again, £a(v) = blue iff 
A outputs only the black node at the gadget at v. This definition translates our assumption of 
E[W] > n/2 into 

E[22(red)] > n/2, (2) 
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where R(£) := |^_ 4 1 (red)| counts the number of gadgets (i.e., vertices of G) relabelled red by A on 
U(G,£). 

If A relabels a blue gadget red, it must output at least two nodes at the gadget. This means 
that the size of the solution output by A on II(G, blue) is at least n + i?(blue). Thus, if A is to 
produce a 3/2- approximation on H(G, blue) in expectation, we must have that 

E[i?(blue)] < n/2. (3) 

The inequalities (2) and (3) provide the necessary boundary conditions (replacing the feasibility 
constraints of Recut) for the argument of Lemma 4: by continuously changing the instance (G, red) 
into (G, blue) we may find an input labelling £* achieving 

B[R(£*)} = n/2 - o(n). (4) 

It remains to argue that A outputs a nearly balanced recut not only "in expectation" but also 
with high probability. 



3.2 Local concentration bound 

Focusing on the instance H(G,£*) we write R = R(£*) and 

R=J2 x v, (5) 

where X v 6 {0, 1} indicates whether A relabels the gadget at v red. 

The variables X v are not too dependent: the 2rth power of G, denoted G 2r , where u,v G V 
are joined by an edge iff Bc(v,r) n Bc{u,r) ^ 0, is a dependency graph for the variables X v . 
Every independent set / C V in G 2r corresponds to a set {X v } ve j of mutually independent random 
variables. Since the maximum degree of G 2r is at most max„ \Bc{v : 2r)\ = o(n), this graph can 
always be partitioned into x(G 2r ) = o(n) independent sets. 

Indeed, Janson [5] presents large deviation bounds for sums of type (5) by applying Chernoff- 
Hoeffding bounds for each colour class in a x(G 2r )-colouring of G 2r . For any e > 0, Theorem 2.1 in 
Janson [5], as applied to our setting, gives 

Pr(R > B[R] + en) < exp (~ 2 ^Sy^) °> as n ~^ 00 > ( 6 ) 

and the same bound holds for Pr [R < ~E[R] — en). That is, R is concentrated around its expectation. 

In conclusion, the combination of (4) and (6) implies that, for large n, the algorithm A outputs 
a nearly balanced recut on U(G,£*) with high probability. By the discussion in Section 2, this 
proves Theorem 1. 
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